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Abstract. We characterize the extremal points of the convex set of quantum 
measurements that are covariant under a finite-dimensional projective repre- 
sentation of a compact group, with action of the group on the measurement 
probability space which is generally non-transitive. In this case the POVM 
density is made of multiple orbits of positive operators, and, in the case of ex- 
tremal measurements, we provide a bound for the number of orbits and for the 
rank of POVM elements. Two relevant applications arc considered, concern- 
ing state discrimination with mutually unbiased bases and the maximization 
of the mutual information. 

1. INTRODUCTION 

A fundamental issue in the theory of quantum information is the investigation 
of the ultimate precision limits for extracting classical information from a quantum 
system. Indeed, when the information is encoded on quantum states, its read-out 
suffers the intrinsically quantum limitation of discriminating among nonorthogonal 
states. One then need to optimize the discrimination with respect to a given opti- 
mality criterion, which is dictated by the particular task for which the measurement 
is designed, or by the particular way the information is encoded over states. The 
good news is that, although the position of the problem has a limited generality 
due to the specific form of the optimality criterion, nevertheless for a large class 
of criteria the optimization method is given by a standard procedure. In such ap- 
proach all possible measurements form a convex set (the convex combination of 
two measurements corresponding to the random choice between their apparatuses), 
and the optimization corresponds to maximizing a convex functional — e. g. the 
mutual informal ion [3 El — or to minimizing a concave functional — e. g. a Bayes 
cost@l |S] — over the convex set of measurements. Since the global maximum of a 
convex functional (or the minimum of a concave functional) is achieved over ex- 
tremal points, the optimization can be restricted to the extremal elements of the 
set only. 

In most situations of interest, the set of signal states on which the information 
is encoded is invariant under the unitary action of some group of physical transfor- 
mations. The symmetry of the set of signal states then reflects into a symmetry of 
the optimal measurements, which without loss of generality can be assumed to be 
covariant ^ with respect to the same group of transformations. 

The problem of charactering extremal covariant measurements has been ad- 
dressed in Refs. ^U\, however restricting the analysis to the case of group-action 
that is transitive on the probability space of measurement outcomes, namely any 
two points in the probability space are connected by some group element. The 
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present paper completes the investigation by generalizing all results to the case 
of non-transitive group actions. Indeed the discrimination of states belonging to 
disjoint group orbits occurs in actual applications, and this situation has received 
little attention in the literature. Moreover, when classical information is encoded 
on quantum states it can be convenient to decode it with a measurement having 
outcomes that are not in one-to-one correspondence with the encoding states. This 
typically happens when the optimality criterion is nonlinear in the probabilities of 
measurement outcomes, as in the case of the mutual information In the pres- 
ence of group symmetry, as recently noted by Decker _:9_ , even if the encoding states 
form a single group orbit, the maximization of the mutual information often selects 
covariant measurements with probability space that splits into disjoint orbits. It is 
then interesting to quantify the number of orbits needed for the maximization of 
the mutual information, or at least to give an upper bound for it. Indeed, as we will 
see in the present paper, the characterization of extremal covariant measurements 
also provides as a byproduct an alternative and simpler derivation of the bound 
given in 

2. Statement of the problem 

In the general framework of quantum mechanics the state of a system is repre- 
sented by a density operator p on a given Hilbert space 7i, whereas the statistics of 
a measurement is described by a positive operator valued measure (POVM), which 
associates a positive semidefinite operator P{B) £ B{H) to any subset B G a{X) 
of the cr-algebra of events in the probability space X. The defining properties for a 
POVM are: 

(1) < P{B) < 1 VBe (t{X) 

OO 

(2) F(ur=iBfe) =^P(Sfc) V{Sfc} disjoint 

k=l 

(3) P{X) = 1 . 

The probability of the event B G (t(X) is then given by the Born rule 

(4) p{B) = TiipP{B)]. 

In this paper we will consider the case where the probability space X supports the 
action of a compact group G, namely any group element g € G acts as a measurable 
automorphism of the probability space X, which maps x € X to gx € X. If any 
two points xi,X2 € X are connected by some group element — i.e. X2 = gxi for 
some g £ G — , the group action is called transitive. In this case, which is the most 
studied in the liter ature E], the whole probability space is the group orbit of an 
arbitrary point xq G X, namely X = {gxo \ g G G}. In this paper we will study 
the more general case where the group action is not transitive, and, accordingly, 
the probability space is not a single group orbit, but the union of a set of disjoint 
orbits, each one being labeled by an index i G I for some set I. For simplicity, we 
will assume the index set I to be finite. 

The simplest case of non-transitive group action then arises when the probability 
space is the Cartesian product of the index set I with the compact group G, i.e. 
X = I X G. In this case, the action of a group element /i G G on a point x = 
{i,g) G I X G is given by hx = {i,hg). Measurements with outcomes in I x G 
naturally arise in the discrimination of a set of signal states which is the union of 
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a certain number of disjoint group orbits, each orbit Oi being generated by the 
action of the group on a given initial state pi, namely Oi — {UgPiU^ \ g E G} 
for some unitary representation R(G) = {Ug \ g £ G}. Precisely, if the stability 
group Gi = {h G G \ UhPiUl = pi} associated to any state pi consists only of the 
identity element e, then there is a one-to-one correspondence between signal states 
and points of the probability space X = I O G. In Section 4 we will study in detail 
the case of POVMs with probability space X = I x G. 

If the stability groups associated to the inital states {pi \ i £ 1} are nontrivial, 
namely G; 7^ {e} for some i £ I, in order to have a one-to-one correspondence 
between signal states and measurement outcomes, one must consider the probability 
space X = Uig|G/G,;, where G/Gi denotes the quotient of G with respect to the 
equivalence relation "g g' if g' = g ■ h for some h £ G^" . This more general case 
will be treated in Section 5. 

Definition 1 (covariant POVMs). Let X he a probability space supporting the group 
action g : x £ X gx £ X. A POVM is covariant[5| if it satisfies the property 

(5) PiB) = UlP{gB)Ug VB £ a(X), Vg G G , 

where gB ^ {gx \ x £ i?} . 

In the case X = I x G, it is simple to prove^] that any covariant POVM admits 
an operator density M{i,g) with respect to the (normalized) Haar measure dg on 
the group G, namely, if -B = {i,A), where A C G is a measurable subset, then 
P{B) = dg M{i,g). Moreover, such an operator density has necessarily the 
form [mi 

(6) M{t,g)^Ug AUl , 

where Ai £ B{Ti) are Hermitian operators satisfying the constraints 

(7) A, > Vz e I 



(8) / ^-9 ^9 A^ Ul^l. 

Here and throughout the paper we adopt for the Haar measure the normalization 

(9) f dg^l. 

Jg 

According to the above discussion, any covariant POVM with probability space 
X = I (g) G is completely specified by a set of operators {Ai \ i £ \}, such that 
both constraints in Eqs. (0 and ((HJ are satisfied. Moreover, it turns out that it 
is very useful to represent such a vector of operators as a single block operator 
A = Ai , acting on an auxiliary Hilbert space Haux = where 

Wi ~ ?i \/i £ \. In terms of the block operator A £ ® B{yVi) the two constraints 
Eq. and Eq. ^ become 

(10) A>0 
and 
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where if : 0,^, B{Wi) B{n) is the Unear map 

(12) ^(A)=E / <i9UgA,Ul . 

The two constraints (|10|l and define such a convex subset of the space of block 
operators 0jg| B{Wi), which is in one-to-one affine correspondence with the convex 
set of covariant POVMs. In the following, the convex set of block operators will be 
denoted by C. 

Proposition 1. The convex set C, defined by the constraints ^lUp and flip is 

compact in the operator norm. 

Proof. Since C is a subset of a finite dimensional vector space, it enough to show 
that C is bounded and closed. C is bounded, since for any A G C, one has ||^|| < 
Ti[A] = EiG|Ti-[^i] = Tt[^{A)] = d (using Eqs. ^ and Moreover, C is 

closed. In fact, if {An} is a Cauchy sequence of points in C, then An converges to 
some block operator A S 0ig| B{Wi). We claim that A belongs to C. Of course, 
A satisfies condition lfTU|l . As regards condition l(TT|) . just notice that the .if is 
continuous, being linear. Therefore, we have \\^{A) — 1|| = \\J^{A — An)\\ 0, 
namely A satisfies condition . ■ 

Observation 1. Since the convex set C is compact, it coincides with the convex 
hull of its extreme points, i.e. any element A e C can be written as convex combi- 
nation of extreme points. The classification of the extreme points of C will be given 
in Section 4. 

Observation 2. In this section and all throughout the paper, G is assumed to 
be a compact Lie group. Nevertheless, all results clearly hold also if G is a finite 
group, with cardinahty |G|. In this case, one only has to make the substitution 
/j-jdg i^XlgGG- Moreover, since now the probability space X = I x G is 

discrete, there is no need of introducing any operator density, and we simply have 

(13) P{z,g)^^^U,A,Ul . 

An example of covariant POVM with a finite symmetry group will be given in 
Section 6. 

3. Some results of elementary group theory 

Let be G a compact Lie group and let he dg the invariant Haar measure on G, 
normalized such that /{jdg = 1. Consider a finite dimensional Hilbert space 7i 
and represent G on 7i by a unitary (generally projective) representation R(G) ~ 
{Ug I g e G}. The collection of equivalence classes of irreducible representations 
which show up in the decomposition of R(G) will be denoted by S. Then Ti can be 
decomposed into the direct sum of orthogonal irreducible subspaces: 

(14) ^ = 00^^ 

ties k=i 

where the index /i labels equivalence classes of irreducible representations (ir- 
reps), while the index z is a degeneracy index labeling different equivalent 
representations in the class /i. Subspaces carrying equivalent irreps have all the 
same dimension d^ and are connected by invariant isomorphisms, namely for any 
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k,l = 1,...,TO^ there is an operator Tj^^ E B{H) such that Supp(T'^;) = Hf, 
Rng(T^;) ~ H'^, and [Tj^nUg] =0 V5 G G. Due to Schur lemmas, any opera- 
tor O in the commutant of the representation R(G) has the form: 

(15) o = 5:£^f^rr,. 

Using the above formula, the normalization of a covariant POVM, given by Eq. 

can be rewritten in a simple form. In fact, due to the invariance of the 
Haar measure dg, we have [^{A),Ug] — V5 £ G, i.e. belongs to the 

commutant of R(G). Then, by exploiting Eq. H15|) . we rewrite the normalization 
constraint as 

(16) ^ Tr[T^; A,] = 4z V// G 5, Vfc, « = 1, . . . , , 

6ki denoting the Kronecker delta. 

Again, this condition can be recasted into a compact form by introducing the 
auxiliary Hilbert space Tiaux = Wi, with Wi ~H V« G I, and constructing a 
block operator with a repeated direct sum of the same operator Tj^^, i. e. 

ie\ 

With this definition, Eq. (|16() becomes 

(18) Tr[S^i A] = d^Ski , y^leS, Vfc, ^ = 1, . . . , , 
where A is the block operator A = Ai. 

4. Extremal covariant POVMs 

This section contains the main result of the paper, namely the characterization of 
the extremal covariant POVMs with probability space l(8)G. Such a characterization 
will be given by exploiting the one-to-one affine corrispondence between the convex 
set of covariant POVMs and the convex set C of block operators defined by the 
constraints H1U|) and Hll|l . or, equivalently, by H1U|I and (|18|l . 

Definition 2. An Hermitian block operator P = Pi is a perturbation of 

A Cz C if there exists an e > such that A + tP (z C for any t e [— e, e] . 

Clearly, a point ^ e C is extreme if and only if it admits only the trivial pertur- 
bation P ~0. 

Lemma 1. A block operator P — ® Pi is a perturbation of A ^ C if and only if 

(19) Supp(P) C Supp(A) 

(20) TriS*^; P] = V/i e 5*, Vfc, ^ = 1, . . . , 

Proof. Condition (|19|) is equivalent to the existence of an e > such that A+tP > 
for all t G [— e, e] (see Lemma 1 of Ref. [7]). On the other hand, condition H2U|) is 
equivalent to require that A + tP satisfies the normalization constraint H16|l for all 
<e he,e]. ■ 

Observation. Note that, due to the block form of both P and A, condition 119|) 
is equivalent to 

(21) Supp(P,) C Supp(A,) Vi G I . 
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Using the previous lemma, we can obtain a first characterization of extremality: 

Theorem 1 (Minimal support condition). A point A ^ C is extremal if and only 
if for any B £ C, 

(22) Supp(B) C Supp(^) =^ A = B. 

Proof. Suppose A extremal. Then, if Supp(i?) C Supp(A), according to Lemma 
^P — A — Bisa perturbation of A € C. Then P must be zero. Viceversa, if 
P is a perturbation of A, then B — A + tP is an element of C for some t 0. 
Due to Lemma n we have Supp(i?) C Supp(yl). Then, condition (|22l) implies 
B = A + tP = A, i.e. P = 0. Therefore, A is extremal. ■ 

Corollary 1. If A E C and rank(v4) — 1, then A is extremal. 

Proof. Since rank(v4) = 1, then, for any B G C, the condition Supp(P) C Supp(74) 
implies B — XA for some A > 0. Moreover, since both A and B are in C, from 
Eq. (UHl) we have = Tr[S'^j.P] = ATr[S'^^A] = Ad^, whence necessarily A = 1. 
Condition H22(l then ensures that A is extremal. ■ 

A deeper characterization of extremal covariant POVMs can be obtained by 
using the following lemma. 

Lemma 2. Let A be a point of C, represented as 

(23) A = Xjx, , 

and define Hi = Rng{Xi) the range of Xi. A block operator P = P; is a 

perturbation of A if and only if 

(24) P, = X\ Q,X, e I , 
for some Hermitian Qi G B{TLi), and 

(25) - ■ 

Proof. First of all, the form (|24|l is equivalent to condition H19|) . In fact, if P has the 
form H24|) . then clearly Supp(P) C Sup'p{A). Viceversa, if we assume condition (|19|) 
and write P — 0jg| P;, we have necessarily Supp(Pi) C Supp(x/Xi) = Supp(Xi). 

Exploiting the singular value decomposition Xi = X)n=i '^n^l^ri)(^rili where 
and are orthonormal bases for Supp(Xi) and Rng(Xi) respectively, we have 

that any Hermitian operator Pi satisfying Supp(Pi) C Supp(Xi) has the form 
Pi ^ Z]m,n^'™"l'"™)(''^"l' whence it can be written as Pi = xjQiXi, for some 
suitable Hermitian operator Qi G B{Rng{X)). Once the equivalence between the 
form l|24() and condition (|19|l is established, relation H25|l follows directly from Eq. 
(EOJ). ■ 

Observation: According to the previous lemma, a perturbation of A is completely 
specified by a set of Hermitian operators {Qi G B{Hi) \ i G \}, where Hi — Rng{Xi). 
Such operators can be casted into a single block operator Q G ®i^\B{Hi) by 
defining 

(26) Q = 0Q. . 

In terms of the block operator Q we have the following: 
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Lemma 3. Let A = XlXi be a point of C. Define the block operators 

(27) Fi:,=^X,S^,,^xl . 

ie\ 

Then A admits a perturbation if and only if there exists an Hermitian block operator 
Q e ©igi B(?^^) such that 

(28) T^AKiQ] = . Va* e S, Vfc, Z = 1, . . . , . 

Proof. Using the definition of Fjfi and the cyclic property of the trace, it is imme- 
diate to see the Eq. (|28|l is equivalent to Eq. H25() . ■ 

The previous lemma enables us to characterize the extremal points of C. 

Theorem 2 (Spanning set condition). Let be A = XjXi be a point of C, and 

F — {Fj^i I /L( G S, fc, ? = 1, . . . , m^} be the set of block operators defined in Lemma 
Then, A is extremal if and only if 

(29) Span(F)=06(H,) : 

iGl 

where Hi = Rng(Xi). 

Proof. A is extremal iff it admits only the trivial perturbation P = 0. Equivalently, 
due to Lemma|31 A is extremal iff the only Hermitian operator Q G ©jgi B{TLi) that 
satisfies Eq. is the null operator Q = 0. Let us decompose the Hilbert space 
JC — ®i(z\B{Hi), as JC ~ Span(F) Span(F)-'-, where _L denotes the orthogonal 
complement with respect to the Hilbert-Schmidt product (A, B) = Tx[A^ B]. Then, 
A is extremal iff the only Hermitian operator in Span(F)-'- is the null operator. This 
is equivalent to the condition Span(F)-'- = {0}, i.e. JC = Span(F). ■ 

Corollary 2. Let A = Xj^i be a point of C, and let define ri — rankX^. // 

A is extremal, then the following relation holds 

(30) E^'^E-M- 

Proof. For an extreme point of C, relation H29() implies that the cardinality of the 
set F is greater than the dimension of /C = ®ie\B{'Hi). Then, the upper bound 
(|Sn|l follows from dim/C = X^igi ^"^^ from the fact that |F| = X^^^gs "^m' ' 

Observation. If the group-representation R(G) is irreducible, than its Clebsch- 
Gordan decomposition contains only one term fl with multiplicity ~ 1. Then, 
bound H3(J|I becomes X^igi ^? — namely for an extremal A = Ai, one has 
necessarily TSLnk(Aig) = 1 for some io € \, and Ai = for any i ^ io (this is also 
a sufficient condition, due to Corollary^. In terms of the corresponding covariant 
POVM M{i,g) = Ug Ai U^, one has M{i,g) = for any i ^ io, i.e. corresponding 
to events in the probability space that never occur. 

5. Extremal covariant POVMs in the presence of nontrivial 

stability groups 

In the previous section, we obtained a characterization of extremal covariant 
POVMs whose probability space is X = I x G for some finite index set I. The 
framework we outlined is suitable for a straightforward generalization to the case 
X = Uig|G/Gi, where Gi are compact subgroups of G. 
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In this case, it is possible to show that a covariant POVM P admits a density 
M{xi) such that for any measurable subset B C G/Gi one has P{B) = Pi{B) = 
/g dxiM(xi), where dxi is the group invariant measure on G/G;. The form of 
the operator density is now 

(31) Mix,) = C/^^(^^) A. Ul^^^^ , 

where Ai > 0, and gi{xi) G G is any representative element of the equivalence class 
Xi G G/Gi. The normalization of the POVM is still given by Eq. H16|l . In addition, 
in order to remove the dependence of M{xi) from the choice of the representative 
gi{xi), each operator Ai must satisfy the relation 

(32) [A,,Uh]=0 yheG,. 

The commutation constraint (|32|l can be simplified by decomposing each represen- 
tation R(Gi) — {Uh I h G Gi} into irreps 

(33) Uh^^ c/r «) a™„^ , 

where m,^. denotes the multiplicity of the irrep Vi, and Si denotes the collection 
of all irreps contained in the decomposition of R(Gi). This corresponds to the 
decomposition of the Hilbert space H as 

(34) n = ^n,,® c"-. , 

where Tii,- is a representation space, supporting the irrep Vi, and C™"' is a multi- 
plicity space. In this decomposition, the commutation relation (|32|l is equivalent to 
the block form 

(35) A, = 01,,® A,,. , 

where Ai ,^. > are operators acting on the multiplicity space 0"*"^ . 

By defining uj = [i, vi) and 17 = U^gi S'i, we can introduce an auxiliary Hilbert 
space, and associate to a covariant POVM the block operator 

(36) A = A^ , 

where A^^ = ^i.i'i • Furthermore, we define the block operators 



(37) S^, = ^S^, 



klu 



where now Skiuj — Tr-^,^. [rti^.r^J. Here H^. denotes the projector onto Hi,- (8) C'"''^ , 
and Tt'h^, denotes the partial trace over Tii,.. With these definitions, the normal- 
ization of the POVM, given by Eq. (|16|l . becomes equivalent to 

(38) nsi:i A] = 4/ . 

Now we call D the convex set of block operators A = ®^(=qAi^, defined by 
the two conditions A > and Eq. H38fl . Such a convex set is in one-to-one affine 
correspondence with the convex set of covariant POVMs with probability space 
X = Ui^\G/Gi. Since the constraints defining D are formally the same defining the 
convex set C, we can exploit the characterization of extremal points of the previous 
section. In particular. Corollary |21 becomes 
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Corollary 3. Let A = ©^jgn X^X^^ be a point of D, and define ri^^. = = 
rank(Xcj). If A is extremal, then the following relation holds: 

(39) EE^U^E™^ 

Observation. As in the case of Corollary [21 if the representation R(G) is irre- 
ducible, as a consequence of the bound about ranks, one obtains Ta,'Dk{A^g) — 1 for 
some luq € and A^^ = for any lu =/= luq. 

6. Applications 

Here we give two examples of the use of the characterization of extremal POVMs 
in the solution of concrete optimization problems. 

6.1. State discrimination with mutually unbiased bases. Here we consider 
a case of state discrimination where the set of signal states is the union of two 
mutually unbiased bases^J related by Fourier transform. Precisely, let Ti he a 
d-dimensional Hilbert space, and consider the orthornormal bases Bi = {\n) \ n = 
0,...,rf-l} andS2 = {|e„) | ti = 0, d - 1}, where |e„) = ^ El~io ^""k), 
uj — exp (^) . Bi and B2 are mutually unbiased, namely |(m|e„)p = 1/d for 
any m,n. Consider the two sets of states defined by Si = {pin = I = 

0, . . . , d — 1} and ^2 — {p2n — |en)(e„| | n = 0, . . . , d — 1}. Now the problem is 
to determine with mimimum error probability the state of the system, which is 
randomly prepared either in a state of Si with probability p/d, or in a state of 52 
with probability (1 —p)/d. 

Exploiting the results of the present paper it is immediate to find the measure- 
ment that minimizes the error probability. In fact, let us consider the irreducible 
representation of the group G = Zj^ x given by 



(40) R(G) = |c/p, = E ^ 1?^ ® P){n\ , {p,q) e Zd X Z^j , 

where ® denotes addition modulo d. Then, the sets Si and 52 are the group 
orbits of the inital states pio and p2o, respectively. Moreover, the states pio and 
P20 have nontrivial stability groups Gi and G2, defined by the unitaries R(Gi) = 
{Uoq I q & Zrf} and R(G2) = {Upo \ p £ Z^}. Therefore, signal states arc in one- 
to-one correspondence with points of the probability space X = G/Gi U G/G2, 
such points being denoted by couples {i,n) where i € {1,2} and n € Z^. For the 
discrimination we can consider without loss of generality a covariant POVM, of the 
form of Eq. where now the group element g is the couple {p,q) G Z^ x Z^. 

Moreover, since the probabilities are linear in the POVM, in the minimization of the 
error probability we can restrict the attention to extremal covariant POVMs. Now, 
the representation R(G) is irreducible, whence Corollary O requires either Ai = 
or A2 = in Eq. (|31|) . This means that either the states in iSi or the states in 52 
are never detected. Moreover, since the states within a given set, either Si or 52, 
are orthogonal, they can be perfectly distinguished among themselves. Therefore, 
the optimal POVM is P(i)(i,n) = Sa\n){n\ ii p > 1/2, and P'^'^^i,n) = 5j2|e„)(e„| 
otherwise. In particular. Hp— 1/2, an experimenter who tries to discriminate 
states of two Fourier transformed bases cannot do anything better than randomly 



10 



GIULIO CHIRIBELLA AND GIACOMO MAURO D'ARIANO 



choosing one of the orthogonal measurements P*^^^ and P^^^. This is the working 
principle of the BB84 crypthographic Drotocol|12j. 

Observation. The previous result can be easily generalized to a case of state 
discrimination with more than two mutually unbiased bases. In fact, if we have a 
set of mutually unbiased bases {Bi | i e 1} that are all generated by the irreducible 
representation R(G) = {Upq , (p, g) € x Z^}, all considerations about extremal 
covariant POVM still hold. If Si is the set of states associated to the basis Bi, and 
Pi /rf is the probability of extracting a state from 5i (X^ieiP^ = l)i then the covariant 
POVM which discriminates the signal states with minimum error probability is the 
orthogonal measurement onto the basis Bj such that pj — maxigijp;}. Notice that, 
if the dimension of the Hilbert space Ti. is d — p'' , where p is some prime number, 
then there are d + 1 MUBs that are generated by the irreducible representation 
R(G) via the construction by Wootters and Fields (11;. 

6.2. MELximization of the mutual information. A frequent problem in quan- 
tum communication is to find the POVM Pi, i e I, that maximizes the mutual 
information with a given set of signal states S — {pj | j G J}. Denoting by pj the 
probability of the signal state pj, by qi = X^jgjPj Tr[MiPj] the overall probability 
of the outcome i, and by pij = pj TT[MiPj] the joint probability of the outcome j 
with the state pi, the mutual information is defined as 

(41) / = Hi{p.,,}) - H{{p,}) - Hi{qj}) , 

where H{{pi}) = —pi log(pi) is the Shannon entropy. As in the minimization of 
a Bayes cost 01 H], when the set of signal states is invariant under the action of some 
finite group G and all states in the same group orbit have the same probability, 
one can without loss of generality restrict the search for the optimal POVM among 
covariant POVMs with probability space X = I (g) G, for some finite index set 
l^lini. However, differently from the case of state discrimination, the points of the 
probability space do not need to be in one-to-one correspondence with the signal 
states. Therefore, the set I is not specified a priori. 

Combining our characterization of extremal covariant POVMs with the following 
basic properties of the mutual information (for the proofs, see Ref.|13p. we can 
readily obtain a bound about the cardinality of the index set I. 

Property 1. The mutual information is a convex functional of the POVM. 

Property 2. In the maximization of the mutual information, one can consider 
without loss of generality POVMs made of rank- one operators. 

Consider a covariant POVM P{i,g) — jgj UgAiU^. Due to Property in 
the maximization of the mutual information we can consider extremal covariant 
POVMs. Then, from Corollary|21 we have the bound X^iei i'^^l^(^j)^ ^ S^igs™^- 
Due to Property 121 this also implies that the number of (rank-one) operators Ai 
must be smaller than X^^es Therefore, we can assume without loss of generality 

(42) \\\<J2<- 

This provides an alternative derivation of the bound given in Ref. jHj . Finally, if the 
representation R(G) is irreducible, the bound gives |l| = 1, namely the probability 
space is X ~ G, according to the classic result of [Oj . 
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